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A SYSTEM FOR MONITORING NON-COINCIDENT, 
NONSTATIONARY PROCESS SIGNALS 

This invention was made with government support under Contract No. W-31- 

109-ENG-38 awarded to the Department of Energy. The Government has certain rights in this 

invention. 

BACKGROUND OF THE INVENTION 

This invention relates generally to a system for monitoring non-coincident, 
nonstationary process signals. More particularly, this invention relates to a system for 
monitoring non-coincident, nonstationary process signals used in detecting deficiencies in 
various stages of manufacturing processes, biological process and the like. 

There is often a need or desire to monitor finite length, non-stationary signals 
that may include repetitive deterministic artifacts that are non-coincident in time. This 
phenomenon occurs, for example, in many engineering systems that contain moving parts that 
are monitored by digitizing sensors monitoring signals relevant to the quality of those parts. 

For example, an assembly line where the thickness of manufactured plastic or 
metal components might be measured. In such an example, every component passing through 
the sensor produces a signal that has a shape that is substantially similar to the preceding 
signal— but the signal may be longer or shorter depending upon the speed of the conveyor belt. 
Another example would be the force applied to the die set in a metal stamping machine. Once 
again, a signal representing this force would possess a similar shape with every repetition of 
the machine's movement. The length of the force signal, however, may be longer or shorter 
depending upon how fast the machine is operating. Biological signals may also produce 



signals with repetitive deterministic artifacts. One such example includes the use of cardiac 
signals monitored from EKG traces. 

In each of the foregoing cases, if one were to digitize and then plot the 
monitored signals, the length of the repetitive deterministic artifacts would vary from part to 
part or from cycle to cycle, depending upon the speed and variability of the system or 
organism being monitored. A reference signal can often be used to compare to these repetitive 
signal waveforms for detection of anomalies, but only if their lengths are exactly the same. If 
their lengths are not the same, large discrepancies between the reference signal and the input 
signal would be seen due to the signals not being coincident. Such discrepancies could result 
in an erroneous diagnosis. 

SUMMARY OF THE INVENTION 

It is therefore an object of the invention to develop an improved method for 
monitoring non-coincident and non-stationary process signals. 

It is a further object of the invention to develop an improved system for 
monitoring non-stationary, non-coincident process signals of a definite length. 

It is yet another object of the invention to develop an improved system for 
monitoring non-coincident, non-stationary process signals that correspond to a manufacturing 
process. 

It is yet another object of the invention to develop an system for monitoring 
non-coincident, non-stationary process signals that correspond to a biological process. 

In accordance with the above objects, a system is provided including a series of 
steps for developing a reference and for characterizing an input signal or signals for 
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meaningful comparison with the reference. The first step includes the use of a training 
sequence for determining a mean and variance of a reference wave form and to define a 
reference wave form length. The leading and falling edges of the repetitive deterministic 
artifacts are determined in the monitored signal and to calculate the sample length. The 
monitored signal is then resampled to properly correlated with the reference signal, and the 
two signals are arranged such that they are coincident in time. The monitored signal is then 
shifted with respect to the sequence signal so that the monitored signal has the same number of 
samples as the reference length identified in the first stsep. The adjusted monitored signal is 
then compared to the stored reference signal. 

These and other objects, advantages and features of the invention together with 
the organization and manner of operation thereof will become apparent from the following 
detailed description when taken into conjunction with the accompanying drawings wherein like 
elements have like numerals throughout the drawings described below. 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a depiction of the overall operation of an example system of the 
present invention; 

FIGURE 2 is an illustration of detailed training module steps for the system of 
FIG. 1 according to one embodiment; 

FIGURE 3 illustrates the detailed monitoring module steps for the system of 

FIG. 1; 

FIGURE 4 A shows an example of a raw data set from a repetitive process; and 
FIG. 4B shows the raw data sequence smoothed using the Savitzky-Golay filter; 



FIGURES 5A-5D show each of the identified signatures as they have been 
identified and extracted from the original data stream; 

FIGURE 6 shows the basic methodology for re-sampling using a digital 
fractional re-sampling filter; 

FIGURE 7 is a depiction of the logic diagram for an expert pump-surveillance 
system operated in accordance with an embodiment of the invention; 

FIGURE 8 is a representation of an expert system for online surveillance of a 
set of nuclear reactor coolant pumps; 

FIGURE 9 is an illustration of a flow diagream of a bounded angle ratio test 
method of data analysis; 

FIGURE 10 is an illustration of conditions and values for carrying out a 
bounded angle ratio test; 

FIGURE 1 1 is an illustration of conditions for comparing similarity of two 
points Xq and X } on the illustration of FIGURE 10; 

FIGURE 12 shows EBR-II channel 1, primary pump 1, power under normal 
operational conditions, and modelled BART; 

FIGURE 13 shows EBR-II channel 2, primary pump 2, power under normal 
operational conditions, and modelled BART; 

FIGURE 14 shows EBR-II channel 3, primary pump 1, speed under normal 
operating conditions and modelled BART; 

FIGURE 15 shows EBR-II channel 4, primary pump 2, speed under normal 
operating conditions and modelled BART; 
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FIGURE 16 shows channel 5 reactor outlet flow rate under normal operating 
conditions and modelled BART; 

FIGURE 17 shows EBR-II channel 6, primary pump 2, flow rate under normal 
conditions and modelled BART; 

FIGURE 18 shows EBR-II channel 7 subassembly outlet temperature 1A1 under 
normal operating conditions and modelled BART; 

FIGURE 19 shows channel 8 subassembly outlet temperature 2B11 under 
normal operating conditions and modelled BART; 

FIGURE 20 illustrates channel 9 subassembly outlet temperature 4E1 under 
normal operating conditions; 

FIGURE 21 illustrates channel 10 subassembly outlet temperature 4F1 under 
normal operating conditions and modelled BART; and 

FIGURE 22A shows an EBR-II primary pump power signal with an imposed 
positive drift; FIGURE 22B shows an application of SPRT to the signal of FIGURE 22 A; 
FIGURE 22C shows an EBR-II primary pump power signal with an imposed positive step 
function; FIGURE 22D shows an application of SPRT to the signals of FIGURE 22C; 
FIGURE 22E shows an EBR-II primary pump power signal with an imposed sinusoidal 
disturbance; FIGURE 22F shows an application of SPRT to the signal of FIGURE 22E. 

DETAILED DESCRIPTION OF THE INVENTION 

In order to illustrate embodiments of the invention, wherein the monitored 
signal and the reference signal comprise a repetitive waveforms, an explanation is provided to 
generally describe the methodology and function for the systematic procedure of the invention 
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and then the stepwise algorithmic approach is presented in detail. Although the manner in 
which the phenomena are described is one rigorous approach which explains the operation of 
the invention for those skilled in the art, other conventional mathematical and theoretical 
explanations can also be used to describe similar results which characterize embodiments of the 
invention. The invention is therefore not limited to the description of its operation by the 
following illustrative mathematical explanations. 

The present invention involves the use of a step-wise procedure for monitoring a 
plurality of repetitive signals. FIG. 1 depicts the overall operation of one embodiment of a 
system of the present invention. The system runs on a computer or is embedded into 
monitoring hardware 201. Before data are analyzed using the system, a training data source 
must be selected, shown at 202. The selection can be an on-line or real-time source 204, or it 
can be a storage media source 203. Once the source has been selected, data are collected for 
building the trained reference patterns, shown at 205, and the results are stored at 206. The 
training data are fed into the training module, shown at 208 and all pertinent parameters and 
reference patterns are calculated. 

After the training process completes steps 201-206, the data source for 
monitoring is selected, shown at 207. Again, the selection can be an on-line or real-time 
source, shown at 204, or it can be a storage media source, shown at 203. Next, data are 
acquired for analysis, shown at 208, from the chosen source. The acquired data are fed as 
input to the system monitoring module, shown at 209, which determines whether or not the 
input data are deviating from the trained normal conditions. The results from the monitoring 
module are directed at 210 to one or both of a data logging system, shown at 21 1, and/or a 
terminal display or monitoring indication mechanism, represented at 212. 
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FIG. 2 illustrates the detailed training module steps for one form of a system of 
the present invention. First, a training data source is selected at 213. The selection can be an 
on-line or real-time source, shown at 215, or it can be a storage media source, shown at 214. 
The data are read into memory either via data acquisition (DAQ) hardware or from storage 
media at 216 depending on the choice made in step 213. 

The first data processing step, shown at 217, is a method for determining the 
leading and trailing edges of each individual signature in the input. An example of this 

p procedure is illustrated in FIGS. 4A-4B and 5A-5D. FIG. 4 A shows an example of a raw data 
set from a repetitive process. The threshold is used to mark the leading edge and trailing edges 
of each of the four signatures in the data sequence. Because the data is noisy, unique 

|d identifiers for the leading and trailing edges are impossible to find using this threshold. One 

u method of overcoming this problem, however, is illustrated in FIG. 4B. The raw data 

if! 

j~ sequence is smoothed using a well-known smoothing algorithm called the Savitzky-Golay 
rl filter. Much of the noise is suppressed using the Savitzky-Golay filter so that the threshold can 
be used effectively to identify the leading and trailing edges of each signature. The markers in 
FIG. 4B show where each of the edges was identified. FIGS. 5A-5D show each of the 
identified signatures as they have been identified and extracted from the original data stream. 

The next step in the training procedure is to store a plurality of identified 
signatures in computer or embedded memory 218. As each signature is extracted from the 
training data set its sample length is measure and stored as well, shown at 219. Then a 
reference length N ref is calculated from all of the measured signature lengths at 220. The 
reference length can be determined from the minimum, maximum, median or mean of the 
plurality of measured signature lengths. 
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The reference length N ref is used to determine the re-sampling rate applied to 
each stored signature so that the lengths of all signatures are the same, represented at 221. The 
re-sampling is accomplished using a digital fractional re-sampling filter. The basic structure of 
the filter is shown in FIG. 6. If the raw input signature or data sequence representated by x(n) 
at 243 has an original length of N, then the signature is re-sampled using the re-sampling filter 
to produce a new signature of length N ref . First x(n) is fed through an expander at 244 that 
inserts N ref zeros between each original sample. Then a low-pass anti-aliasing filter 245, is 
applied to the resulting zero padded data sequence acting as an interpolator. The interpolated 
sequence is then decimated at 246 by a factor of N to produce the desired length of N ref for the 
output signature y(n). In cases where N and N ref are large, may be more efficient to first 
simplify the ratio N ref /N to their equivalent ratio of smallest integer (i.e., 40/30 = 4/3). 

In step 222 the re-sampled signatures are padded on both sides with a plurality 
of zeros. Each new re-sampled signature is compared with all previously processed signatures 
using a vector similarity calculation defined to be between 0 and 1 (1 for identical and 0 for no 
similarity) at step 223. The new signature is shifted forward and/or backward until the 
similarity is maximized, ensuring that the signatures optimally line up with one another. After 
the signatures have been lined up, the extraneous samples on both ends of the signature are 
removed at 224. 

The next step in the training process is to calculate the mean and standard 
deviation for each sample in the N ref length signatures producing N ref mean values and N ref 
standard deviation values, shown at 225. The parameter N ref and the vectors of mean values 
and standard deviation values are stored for use during the monitoring phase of operation at 
226 and the training is completed at step 227. 
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FIG. 3 illustrates the detailed monitoring module steps for the system of the 
present invention. First, a monitored data source is selected at 228. The selection can be an 
on-line or real-time source, shown at 230, or it can be a storage media source, represented at 
229. Next the training parameters; mean values standard deviation values (a), and 
reference length N ref are loaded into memory at 231 and the monitored data are read into 
memory either via data acquisition (DAQ) hardware or from storage media depending on the 
choice made in step 228. 

The leading and trailing edges of each signature present in the monitored data are 
identified sequentially at 233 using the procedure described during the training phase step 217. 
The signatures are re-sampled to equalize their lengths in step 234 using the same procedure as 
in step 221. The similarity optimization at 235, 236 and 237 is used to line the monitored 
signature with the reference mean, \i calculated during step 225 of the training phase. A 
number of similarity measurement techniques may be used. In one embodiment of the 
invention, a bounded angle ratio test (BART) is used as the similarity measurement technique. 
The BART system is discussed in detail in U.S. Patent Application No. 09/373,326, 
incorporated herein by reference. It is also possible to use other systematic methods for the 
third step 520. For example, one could measure the distance between two Euclidean vectors as 
a possible technique. The details of the most preferred BART measurement technique are 
described below. 

The re-sampled and lined up signature is then differenced with the mean value vector to 
produce the residual vector R in step 238. In a particular embodiment of the invention, this is 
accomplished using a non-stationary sequential probability ratio test (SPRT). The SPRT 
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system is discussed in detail in U.S. Patent No. 5,223,207 and incorporated herein by 
reference. A SPRT decision ratio is then calculated to determine whether the monitored signal 
falls outside of normal operating conditions. This monitoring procedure can continue in real- 
time for the remainder of the operating run. Alternatively, the procedure can continue until a 
user decides to retrain the automated system. 

Parameter settings for the detection engine 240 are set manually before 
monitoring begins or are loaded from a stored data file that can be used over and over at step 
239. The results of the detection engine 240 are then processed in step 241 to determine the 
amount of deviation in the monitored signatures from the trained reference signature. The 
processing step produces an alert if the deviation is greater than a user specified amount 
(SFMp-positive deviation, SFMn-negative deviation, SFMs-standard deviation change) with a 
confidence level determined by specified false (a) and missed alarm (p) probabilities. The 
alert is then logged and/or displayed in the final step of the monitoring process at 242. 

As described above, a non-stationary sequential probability ratio test (SPRT) is 
preferably used to compare the adjusted monitored signal to the stored reference signals. In 
one example of the method, SPRT teaches a expert system and method to determine the 
degradation of nuclear reactant coolant pumps and their respective sensors prior to failure. 

FIG. 8. illustrates the architecture of the expert system for an online pump- 
surveillance system. The two coolant pumps 1 and 2 are each equipped with numerous sensors 
3-6. A typical sensor arrangement is depicted in FIG. 8 where seven sensors are employed: 
three sensors 3 which monitor the rotor shaft speed, two accelerometers 4 which monitor the 
mechanical vibration of the pump, a pump power measuring device 5 which measures the 
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power needed by the motor to turn the rotor, and a discharge pressure transducer 6 which 
measures the flow rate of coolant through the pump. The information from the sensors 3-6 is 
transmitted to the data acquisition system 7 (DAS) which then interfaces with the artificial 
intelligence (AI) based inference engine 8. The AI inference engine 8 implements an 
operability logic algorithm illustrated in FIG. 7. The AI software for the inference engine 8 is 
supported by a layer of utility routines which perform generic functions such as loading 
external tables, providing access to shared knowledge base, activating interprocess 
synchronization, and performing network communication. Output from the AI engine 8 is 
integrated to a color-graphics display 9 in the reactor room and is multiplexed back to the data 
acquisition system 7 for archive backup storage. If the inference engine 8 detects a 
degradation in the pump or its sensors, a pump sensor failure alarm 10 or a pump disturbance 
alarm 11 is sounded. 

FIG. 7 illustrates a flow chart for determining the condition of the cooling 
pumps through the employment of a sequence of mathematical algorithms associated with a 
series of sequential probability ratio test (SPRT) modules. The input signals 12 are acted on 
mathematically by a sensitive pattern recognition technique, the sequential probability ratio test 
(SPRT). The use of the SPRT technique through several "if-then" steps provides for early 
annunciation of sensor operability or degradation of the coolant pump. Each of the modules 
13, 14, 15, and 16 employs the SPRT technique to determine the condition of the respective 
sensors for the purpose of determining whether a problem is sensor or pump related. The 
modules present in the expert system include a shaft speed SPRT module 13, a vibration level 
SPRT module 14, a power signal SPRT module 15, and a discharge pressure SPRT module 16. 
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Each SPRT module is connected to an audible alarm 17 which is sounded when a sensor 

degradation is determined. If no sensor degradation is determined the degradation is 

t 

determined to be due to the pump, and the pump disturbance alarm 11 sounds. 

The various recited SPRT modules monitor and compare the signals from two 
similar sensors which respond to a single parameter representing a physical condition 
associated with the pump. The purpose of this comparison is to identify subtle changes in the 
statistical quality of the noise associated with either signal when compared one to the other. In 
applications involving two or more reactor coolant pumps equipped with identical sensors, a 
SPRT monitor applied to the pumps will provide a sensitive annunciation of any physical 
disturbance affecting one of the pumps. If each of the pumps had only one sensor, it would be 
difficult for the SPRT technique to distinguish between a pump degradation event and a 
degradation of the sensor itself. However, when each pump is equipped with multiple, 
redundant sensors, the SPRT technique can be applied to pairs of sensors on each individual 
pump for sensor-operability verification. 

As is illustrated in the logic diagram of FIG. 7, the expert system is synthesized 
as a collection of "if-then" type rules. Each SPRT module processes and compares the 
stochastic components of the signals from two sensors that are ostensibly following the same 
physical process. If any physical disturbance causes the noise characteristics for either signal 
to change, that is, a larger variance, skewness, or signal bias, then the SPRT technique 
provides a sensitive and rapid annunciation of that disturbance while minimizing the 
probabilities of both false alarms and missed alarms. 
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The processor 18, of module 13, first interrogates the signals Nl and N2, 
representing the mean shaft speed for the coolant pumps 1 and 2, respectively. The mean shaft 
speed signal is obtained by averaging the outputs of the three RPM sensors 3 on each of the 
pumps 1 and 2. If a problem is identified in the comparison of Nl and N2, a sequence of 
SPRT tests is invoked to validate the three sensors on the pump 1, signified by Al, Bl, and 
CI. If one of those sensors is identified as degraded, an audible alarm 11 is actuated. If the 
three sensors on pump 1 are found to be operating within tolerance, then the three 
corresponding sensors on the pump 2 are tested. If all six sensors are confirmed to be 
operational, execution is passed to the next SPRT module which in this case is the SPRT 
module 14 which tests the vibration-level variable. If these sensors are found to be 
operational, then the testing is functionally shifted to the module 15 the power-signal variable, 
and then if it is found to be functioning properly to the module 16 the discharge-pressure 
variable. This sequential organization is illustrated in FIG. 7. If a problem is identified in any 
module, an audible alarm, 10, 11 or 17 is sounded in the reactor control room, and the 
operator can initiate a manual shutdown of the reactor to repair the identified problem. 

. The objective of the AI engine in the expert system is to analyze successive 
observations of a discrete process Y which represents a comparison of the stochastic 
components of two physical processes monitored by similar sensors. Let y k represent a sample 
from the process Y at time t. During normal operations with an undergraded physical system 
and with sensors that are functioning within specifications, the y kj should be normally 
distributed with means 0. If the two signals being compared do not have the same nominal 
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means due, for example, to differences in calibration, then the input signals will be pre- 
normalized to the same nominal mean values during initial operation. 

The specific goal of the A 1 engine is to declare system 1 or system 2 degraded 
if the drift in Y is sufficiently large that the sequence of observations appears to be distributed 
about means + M or - M, where M is a preassigned system distribution magnitude. The 
SPRT provides a quantitative framework that enables us to decide between two hypotheses, H 
and H2, namely: 

HI: Y is drawn from a Gaussian product distribution function (PDF) with 
means M and variance a 2 . 

H2: Y is drawn from a Gaussian PDF with mean O and variance a 2 . 

If it is supposed that HI or J2 is true, we wish to decide for HI or H2 with 
probability (1-p) or (1-ct) respectively, where a and p represent the error (misidentification) 
probabilities. 

From the theory described by Wald and Wolfowitz in "Optimum Character of 
the Sequential Probability Ratio Test, "Ann. Math. Stat., 19,326 (1948), the most powerful 
test depends on the likelihood ratio l n , where 

Probability of observed sequence given HI true. 

z = y u yi yn 

yxji^.^y n 

Probability of observed sequence given H2 true. 
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After n observations have been made, the sequential probability ratio is just the 
product of he probability ratio is just the product of the probability ratios for each step: 

l H ={PRl)-(PR2)-(PR3): . .(PR,) 
or 

i=i f(y,\n 2 

where F(y;\H) is the distribution of the random variables^. 

the Wald-Wolfowitz theory operates as follows: Continue sampling as long as 

A<\n<B 

(1) 

Stop sampling and decide HI as soon as In > B, and stop sampling and decide HI as 
soon as In < A. The acceptance thresholds are related to the error (misidentification) 
probabilities by the following expressions. 

l-a a 

(2) 

where 

a = probability of accepting H2 when HI is true (false alarm probability) 
P = probability of accepting// 2 when HI is true (missed alarm probability) 

Assuming the random variable y k is normally distributed, the likelihood that HI 
is true (mean M, variance a 2 ) is given by 

L{y x ,y 2 ,y^--y n \H\) = 

E 

k=l 
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£ ykM + 

k=l 



n 




E M 2 









Similarly for H2 (means o, variance a 2 ), 



H2) = 



(2*> 



n/2an 



exp 



/ n 

E yk 2 



(4) 



expressed as 



(5) 



The ratio of equations (3) and (4) gives the likelihood ratio In ; where \n is 



where 



In = exp E 

k=l 



M(M-2yk) ] 



combining equations 1, 2 and 5, and taking the natural logs, gives 



In\p/(l-a ) ] S =i 



M(M-2yk)<In[ (l-fi )a ] 



ri 



as 



SPRT } , 

k= 1 



M(M - 2yk) or 



M 

*■ T 



then the sequential sampling and decision strategy can be concisely represented 

If SPRT < In(pi\ - a) accept H2 
continue sampling 



If SPRT >/«[(!- pi a ] accept HI 
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The SPRT analysis formulated here cannot be applied directly to non-Gaussian 
signals. For applications to nuclear system signals contaminated by non-Gaussian noise, an 
attempt should first be made to pretreat the input signals with a normalizing transformation. 

For applications where (a) one requires a high degree of assurance that a system 
is functioning within specifications and (b) there is not a large penalty associated with false 
alarms, it is not uncommon to specify a B (missed alarm probability) that is much smaller than 
A (false alarm probability). In safety critical systems one may be more willing to incur a false 
alarm than a missed alarm. For applications where a large cost penalty is incurred with any 
false alarms, it is desirable to keep both A and B small. 

The trade-off that must be considered before one specifies arbitrarily small 
values for A and B is the effect this may have on the sensitivity and maximum decision time 
needed by the SPRT to annunciate a disturbance. The desired sensitivity of the SPRT is fixed 
by specification of M, the system disturbance magnitude. For a given value of M, the average 
sample number required to reach a decision is influenced by A and B and also by the variance 
associated with the signals being monitored. It takes longer to identify a subtle change in a 
process characterized by a low signal-to-noise ratio than in one with a high signal-to-noise 
ratio. 

The non-stationary version of the SPRT algorithm is a slightly modified version 
of Wald's SPRT. In the non-stationary case, the failure magnitude, M, reference signal (or 
mean), p., and the reference variance, cr 2 , are sample dependent. Therefore, the non-stationary 
SPRT equation becomes 
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SPRT(n) = SPRT(n-l) + M(w) 



a\n) 

where n = 1,2,...,L and L is the length of the length equalized signals. In this 
case, y(n) is the length of the equalized monitored signal, |i(n) is the corresponding reference 
signal generated during the training phase and a^n) is the variance of each point in ji(n). 

The bounded angle ratio test (hereinafter BART) mentioned above is employed 
in systems with more than two variables, as shown in FIG. 9. For example, BART can be 
used on an actual sensor signal exhibiting non- white characteristics, such as for example, on 
sensor signals from the primary pump #2 of the EBR-II nuclear reactor at Argonne National 
Laboratory (West) in Idaho. In such a case, the signal can be a measure of the pump's speed 
over a given amount of time. In such a situation, one can use a nonlinear multivariate 
regression technique that employs an N Dimensional Space (known in vector calculus 
terminology as hyper space) to model the relationships between all of the variables. This 
regression procedure results in a nonlinear synthesized estimate for each input observation 
vector based on the hyperspace regression model. The nonlinear multivariate regression 
technique is centered around the hyperspace BART operator that determines the element by 
element and vector to vector relationships of the variables and observation vectors, given a set 
of system data that is recorded during a time period when everything is functioning correctly. 

In the BART method described in FIG. 9., the method is also split into a 
training phase and a monitoring phase. The first step in the training phase is to acquire a data 
matrix continuing data samples from all of the sensors (or data sources) used for monitoring 
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the system that are coincident in time and are representative of normal system operation. Then 
the BART parameters are calculated for each sensor (Xmed, Xmax and Xmin). Here Xmed is 
the median value of a sensor. The next step is to determine the similarity domain height for 
each sensor (h) using the BART parameters Xmed, Xmax and Xmin. Once these parameters 
are calculated a subset of the data matrix is selected to create a model matrix (H) that is used in 
the BART estimation calculations. Here, H is an NxM matrix where N is the number of 
sensors being monitored and M is the number of observations stored from each sensor. The 
last steps taken during the training phase are the SPRT parameters calculations. The 
calculations are analogous to the calculations in the other methods, except that now the 
standard deviation value used to calculate SDI is obtained from BART estimation errors from 
each sensor (or data source) under normal operating conditions. 

During the BART monitoring phase, a sample vector is acquired at each time 
step t, that contains a reading from all of the sensors (or data sources) being used. Then the 
similarity angle (SA) between the sample vector and each sample vector stored in H is 
calculated. Next an estimate of the input sample vector Y is calculated using the BART 
estimation equations. The difference between the estimate and the actual sensor values is then 
used as input to the SPRT module. Each difference is treated separately so that a decision can 
be made on each sensor independently. This method is described in more detail hereinafter. 

In this preferred embodiment of FIG. 9 of the invention, the method measures 
similarity between scalar values. BART uses the angle formed by the two points under 
comparison and a third reference point lying some distance perpendicular to the line formed by 



1 



20 



the two points under comparison. By using this geometric and trigonometric approach, BART 
is able to calculate the similarity of scalars with opposite signs. 

In the most preferred form of BART an angle domain must be determined. The 
angle domain is a triangle whose tip is the reference point (R), and whose base is the similarity 
domain. The similarity domain consists of all scalars which can be compared with a valid 
measure of similarity returned. To introduce the similarity domain, two logical functional 
requirements can be established: 

0 The similarity between the maximum and minimum values in the similarity domain is 0, 
and 

1 the similarity between equal values is 1 . 

Thus the similarity range (i.e. all possible values for a measure of similarity), is in the range 0 
to 16 inclusive. 

BART also requires some prior knowledge of the numbers to be compared for 
determination of the reference point (R). Unlike a ratio comparison of similarity, BART does 
not allow "factoring out" in the values to be compared. For example, with the BART 
methodology the similarity between 1 and 2 is not necessarily equal to the similarity between 2 
and 4. Thus, the location of R is vital for good relative similarities to be obtained. R lies over 
the similarity domain at some distance h, perpendicular to the domain. The location on the 
similarity domain at which R occurs (Xmed) is related to the statistical distribution of the 
values to be compared. For most distributions, the median or mean is sufficient to generate 
good results. In a preferred embodiment the median is used since the median provides a good 
measure of data density and is resistant to skewing caused by large ranges of data. 
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Once Xmed has been determined, it is possible to calculate h. In calculating h, 
it is necessary to know the maximum and minimum values in the similarity domain. (Xmax 
and Xmin respectively) for normalization purposes the angle between Xmin and Xmax is 
defined to be 90°. The conditions and values defined so far are illustrated in FIG. 10. From 
this triangle it is possible to obtain a system of equations and solve for h as shown below: 

c - Xmed - Xmin 
d - Xmax - Xmed 
a 2 - c 2 + h 2 

b 2 -d 2 + h 2 (19) 

(c+d) 2 -a 2 + b 2 

(c+d) 2 -c 2 + d 2 + 2h 2 

h 2 -cd 

h-cd 

Once h has been calculated the system is ready to compute similarities. Assume 
that two points: Xq and X, (Xq < X x ) are given as depicted in FIG. 11 and the similarity 
between the two is to be measured. The first step in calculating similarity is normalizing Xq 
and X, with respect to Xmed. This is done by taking the euclidean distance between Xmed and 
each of the points to be compared. Once X 0 and X, have been normalized, the angle ZXoRXj 
(hereinafter designated 0) is calculated by the formula: 

G = ArcTan(X, |h) = ArcTan(Xo|h) (20) 

After 0 has been found, it must be normalized so that a relative measure of 
similarity can be obtained that lies within the similarity range. To ensure compliance with 
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functional requirements (A) and (B) made earlier in this section, the relative similarity angle 
(SA) is given by: 

(21) 

SA = 1- 8 
90° 

Formula (21) satisfies both functional requirements established at the beginning 
of the section. The angle between Xmin and Xmax was defined to be 90°, so the similarity 
between Xmin and Xmax is 0. Also, the angle between equal values is 0° The SA therefore 
will be confined to the interval between zero and one, as desired. 

To measure similarity between two vectors using the BART methodology, the 
average of the element by element SAs are used. Given the vectors x x and x 2 the SA is found 
by first calculating S s for i = 1,2,3. ..n for each pair of elements in x } and x 2 i.e., 

if Xi = [X n X 12 X 13 ...X 1 Jand 2b = [X 21 X 22 X 23 ... X 2n ] 

The vector SA T is found by averaging over the Si's and is given by the following 
equation. 



r= I Z s > < 22 > 
n i=i 

In general, when given a set of multivariate observation data from a process (or 
other source of signals), linear regression could be used to develop a process model that relates 
all of the variables in the process to one another. An assumption that must be made when 
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using linear regression is that the cross-correlation information calculated from the process 
data is defined by a co variance matrix. When the cross-correlation between the process 
variables is nonlinear, or when the data are out of phase, the covariance matrix can give 
misleading results. The BART methodology is a nonlinear technique that measures similarity 
instead of the traditional cross-correlation between variables. One advantage of the BART 
method is that it is independent of the phase between process variables and does not require 
that relationships between variables be linear. 

If there is a random observation vector y and a known set of process observation 
vectors from a process P, it can be determined if y is a realistic observation from a process P 
by combining BART with regression to form a nonlinear regression method that looks at 
vector SAs as opposed to euclidean distance. If the know observation vectors taken from P are 
given by 
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(23) 



H = 



(h, 



v 



Yin 



^1 \ 



him 
him 



[ hi h2 h3 ] 

where H is k by m (k being the number of variables and m the number of observations), 
then the closest realistic observation vector to y in process P given H is given by 

y = Hw (24) 
Here w is a weighting vector that maps a linear combination of the observation vectors 
in H to the most similar representation of y. The weighting vector w is calculated by 
combining the standard least squares equation form with BART. Here 0 stands for the SA 
operation used in BART. 



w = (H' © H)" H' © y 



(25) 



An example of use of the BART methodology was completed by using 10 EBR- 
II sensor signals. The BART system was trained using a training data set containing 1440 
observation vectors. Out of the 1440 observation vectors, 129 of these were chosen to be used 
to construct a system model. The 129 vectors were also used to determine the height, h, of the 
angle domain boundary as well as the location of the BART reference point R for each of the 
sensors used in the experiment. To test the accuracy of the model 900 minutes of one minute 
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data observation vectors under normal operating conditions were run through the BART 
system. The results of the BART system modeling accuracy are shown in FIGS. 12-16 and 
FIGS. 17-21 (BART modeled). The Mean Squared Errors for each of the sensor signals is 
shown in Table III. 



TABLE III 


BART System Modeling Estimation Mean Squared Errors i 


for EBR-II Sensor Signals 












Sensor 


Sensor Description 


MSE of 


Normalized 


Normalized 


Channel 




Estimation 


MSE 


MSE 






Error 


(MSE/H,) 


(MSE/o,) 


1. 


Primary Pump #1 Power (KW) 


0.0000190 


0.0000002 


0.0002957 


2. 


Primary Pump #2 Power (KW) 


0.0000538 


0.0000004 


0.0004265 


3. 


Primary Pump #1 Speed (RPM) 


0.0000468 


0.0000001 


0.0005727 


4. 


Primary Pump #2 Speed (RPM) 


0.0000452 


0.0000001 


0 0004571 


5. 


Reactor Outlet Flowrate (GPM) 


8.6831039 


0.0009670 


0.1352974 


6. 


Primary Pump #2 Flowrate (GPM) 


0.0571358 


0.0000127 


0.01.63304 


7. 


Subassembly Outlet Temperature 1A1 (F) 


0.0029000 


0.0000034 


0.0062368 


8. 


Subassembly Outlet Temperature 2B1 (F) 


0.0023966 


0.0000027 


0.0052941 


9. 


Subassembly Outlet Temperature 4E1 (F) 


0.0025957 


0.0000029 


0.0050805 


10. 


Subassembly Outlet Temperature 4F1 (F) 


0.0024624 


0.00O0028 


0.00 1358 



A second example shows the results of applying BART to ten sensors signals 



with three different types of disturbances with their respective BART estimates superimposed 
followed by the SPRT results when applied to the estimation error signals. The first type of 
disturbance used in the experiment was a simulation of a linear draft in channel #1. The drift 
begins at minute 500 and continues through to the end of the signal, reaching a value of 0.21 % 
of the sensor signal magnitude and the simulation is shown in FIG. 22 A. The SPRT (FIG. 2B) 
detects the drift after it has reached a value of approximately 0.06% of the signal magnitude. 
In FIG. 22C a simulation of a step failure in channel #2 is shown. Here the step has a height 
of 0.26% of the signal magnitude and begins at minute 500 and continues throughout the 
signal. FIG. 22D shows the SPRT results for the step failure. The SPRT detects the failure 
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immediately after it was introduced into the signal. The last simulation was that of a sinusoidal 
disturbance introduced into channel #6 as shown in FIG. 22E. The sinusoid starts at minute 
500 and continues throughout the signal with a constant amplitude of 0.15% of the sensor 
signal magnitude. The SPRT results for this type of disturbance are shown in FIG. 22F. 
Again the SPRT detects the failure even though the sinusoid's amplitude is within the operating 
range of the channel #6 sensor signal. 

While preferred embodiments have been shown and described, it should be 
understood that changes and modifications can be made therein without departing from the 
invention in its broader aspects. For example, it is possible that signals or waveforms could be 
measured from processes other than those in the manufacturing or biological fields. 
Additionally, there are many comparison techniques that could be used to correlate and 
compare the signals measured according to this invention. Various features of the invention 
are defined in the following Claims. 
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